当前位置: 开发笔记 > 编程语言 > 正文

DGLRDKit|基于AttentiveFP可视化训练模型原子权重

作者：手机用户2502892403 | 来源：互联网 | 2023-09-06 08:34

DGL具有许多用于化学信息学、药物与生物信息学任务的函数。DGL开发人员提供了用于可视化训练模型原子权重的代码。使用AttentiveFP构建模型后，可以可视化给定

DGL具有许多用于化学信息学、药物与生物信息学任务的函数。

DGL开发人员提供了用于可视化训练模型原子权重的代码。使用Attentive FP构建模型后&＃xff0c;可以可视化给定分子的原子权重&＃xff0c;意味着每个原子对目标值的贡献量。

基于Attentive FP可视化训练模型原子权重

环境准备

PyTorch&＃xff1a;深度学习框架
DGL&＃xff1a;基于PyTorch的库&＃xff0c;支持深度学习以处理图形
RDKit&＃xff1a;用于构建分子图并从字符串表示形式绘制结构式
MDTraj&＃xff1a;用于分子动力学轨迹分析的开源库

导入库

%matplotlib inline import matplotlib.pyplot as plt import os from rdkit import Chem from rdkit import RDPathsimport dgl import numpy as np import random import torch import torch.nn as nn import torch.nn.functional as F from torch.utils.data import DataLoader from torch.utils.data import Dataset from dgl import model_zoofrom dgl.data.chem.utils import mol_to_complete_graph, mol_to_bigraphfrom dgl.data.chem.utils import atom_type_one_hot from dgl.data.chem.utils import atom_degree_one_hot from dgl.data.chem.utils import atom_formal_charge from dgl.data.chem.utils import atom_num_radical_electrons from dgl.data.chem.utils import atom_hybridization_one_hot from dgl.data.chem.utils import atom_total_num_H_one_hot from dgl.data.chem.utils import one_hot_encoding from dgl.data.chem import CanonicalAtomFeaturizer from dgl.data.chem import CanonicalBondFeaturizer from dgl.data.chem import ConcatFeaturizer from dgl.data.chem import BaseAtomFeaturizer from dgl.data.chem import BaseBondFeaturizerfrom dgl.data.chem import one_hot_encoding from dgl.data.utils import split_datasetfrom functools import partial from sklearn.metrics import roc_auc_score

代码来源于dgl/example

DGL开发人员提供了用于可视化训练模型原子权重的代码。

使用Attentive FP构建模型后&＃xff0c;可以可视化给定分子的原子权重&＃xff0c;意味着每个原子对目标值的贡献量。

def chirality(atom):try:return one_hot_encoding(atom.GetProp(&＃39;_CIPCode&＃39;), [&＃39;R&＃39;, &＃39;S&＃39;]) &＃43; \[atom.HasProp(&＃39;_ChiralityPossible&＃39;)]except:return [False, False] &＃43; [atom.HasProp(&＃39;_ChiralityPossible&＃39;)]def collate_molgraphs(data):"""Batching a list of datapoints for dataloader.Parameters----------data : list of 3-tuples or 4-tuples.Each tuple is for a single datapoint, consisting ofa SMILES, a DGLGraph, all-task labels and optionallya binary mask indicating the existence of labels.Returns-------smiles : listList of smilesbg : BatchedDGLGraphBatched DGLGraphslabels : Tensor of dtype float32 and shape (B, T)Batched datapoint labels. B is len(data) andT is the number of total tasks.masks : Tensor of dtype float32 and shape (B, T)Batched datapoint binary mask, indicating theexistence of labels. If binary masks are notprovided, return a tensor with ones."""assert len(data[0]) in [3, 4], \&＃39;Expect the tuple to be of length 3 or 4, got {:d}&＃39;.format(len(data[0]))if len(data[0]) &＃61;&＃61; 3:smiles, graphs, labels &＃61; map(list, zip(*data))masks &＃61; Noneelse:smiles, graphs, labels, masks &＃61; map(list, zip(*data))bg &＃61; dgl.batch(graphs)bg.set_n_initializer(dgl.init.zero_initializer)bg.set_e_initializer(dgl.init.zero_initializer)labels &＃61; torch.stack(labels, dim&＃61;0)if masks is None:masks &＃61; torch.ones(labels.shape)else:masks &＃61; torch.stack(masks, dim&＃61;0)return smiles, bg, labels, masksatom_featurizer &＃61; BaseAtomFeaturizer({&＃39;hv&＃39;: ConcatFeaturizer([partial(atom_type_one_hot, allowable_set&＃61;[&＃39;B&＃39;, &＃39;C&＃39;, &＃39;N&＃39;, &＃39;O&＃39;, &＃39;F&＃39;, &＃39;Si&＃39;, &＃39;P&＃39;, &＃39;S&＃39;, &＃39;Cl&＃39;, &＃39;As&＃39;, &＃39;Se&＃39;, &＃39;Br&＃39;, &＃39;Te&＃39;, &＃39;I&＃39;, &＃39;At&＃39;],encode_unknown&＃61;True),partial(atom_degree_one_hot, allowable_set&＃61;list(range(6))),atom_formal_charge, atom_num_radical_electrons,partial(atom_hybridization_one_hot, encode_unknown&＃61;True),lambda atom: [0], # A placeholder for aromatic information,atom_total_num_H_one_hot, chirality],)}) bond_featurizer &＃61; BaseBondFeaturizer({&＃39;he&＃39;: lambda bond: [0 for _ in range(10)]})train_mols &＃61; Chem.SDMolSupplier(&＃39;solubility.train.sdf&＃39;) train_smi &＃61;[Chem.MolToSmiles(m) for m in train_mols] train_sol &＃61; torch.tensor([float(mol.GetProp(&＃39;SOL&＃39;)) for mol in train_mols]).reshape(-1,1)test_mols &＃61; Chem.SDMolSupplier(&＃39;solubility.test.sdf&＃39;) test_smi &＃61; [Chem.MolToSmiles(m) for m in test_mols] test_sol &＃61; torch.tensor([float(mol.GetProp(&＃39;SOL&＃39;)) for mol in test_mols]).reshape(-1,1)train_graph &＃61;[mol_to_bigraph(mol,node_featurizer&＃61;atom_featurizer, edge_featurizer&＃61;bond_featurizer) for mol in train_mols]test_graph &＃61;[mol_to_bigraph(mol,node_featurizer&＃61;atom_featurizer, edge_featurizer&＃61;bond_featurizer) for mol in test_mols]def run_a_train_epoch(n_epochs, epoch, model, data_loader,loss_criterion, optimizer):model.train()total_loss &＃61; 0losses &＃61; []for batch_id, batch_data in enumerate(data_loader):batch_datasmiles, bg, labels, masks &＃61; batch_dataif torch.cuda.is_available():bg.to(torch.device(&＃39;cuda:0&＃39;))labels &＃61; labels.to(&＃39;cuda:0&＃39;)masks &＃61; masks.to(&＃39;cuda:0&＃39;)prediction &＃61; model(bg, bg.ndata[&＃39;hv&＃39;], bg.edata[&＃39;he&＃39;])loss &＃61; (loss_criterion(prediction, labels)*(masks !&＃61; 0).float()).mean()#loss &＃61; loss_criterion(prediction, labels)#print(loss.shape)optimizer.zero_grad()loss.backward()optimizer.step()losses.append(loss.data.item())#total_score &＃61; np.mean(train_meter.compute_metric(&＃39;rmse&＃39;))total_score &＃61; np.mean(losses)print(&＃39;epoch {:d}/{:d}, training {:.4f}&＃39;.format( epoch &＃43; 1, n_epochs, total_score))return total_scoremodel &＃61; model_zoo.chem.AttentiveFP(node_feat_size&＃61;39,edge_feat_size&＃61;10,num_layers&＃61;2,num_timesteps&＃61;2,graph_feat_size&＃61;200,output_size&＃61;1,dropout&＃61;0.2)train_loader &＃61; DataLoader(dataset&＃61;list(zip(train_smi, train_graph, train_sol)), batch_size&＃61;128, collate_fn&＃61;collate_molgraphs) test_loader &＃61; DataLoader(dataset&＃61;list(zip(test_smi, test_graph, test_sol)), batch_size&＃61;128, collate_fn&＃61;collate_molgraphs)loss_fn &＃61; nn.MSELoss(reduction&＃61;&＃39;none&＃39;) optimizer &＃61; torch.optim.Adam(model.parameters(), lr&＃61;10 ** (-2.5), weight_decay&＃61;10 ** (-5.0),) n_epochs &＃61; 100 epochs &＃61; [] scores &＃61; [] for e in range(n_epochs):score &＃61; run_a_train_epoch(n_epochs, e, model, train_loader, loss_fn, optimizer)epochs.append(e)scores.append(score) model.eval()

导入用于分子可视化依赖库

import copy from rdkit.Chem import rdDepictor from rdkit.Chem.Draw import rdMolDraw2D from IPython.display import SVG from IPython.display import display import matplotlib import matplotlib.cm as cm

定义可视化函数

代码来源于DGL库。
DGL模型具有get_node_weight选项&＃xff0c;该选项返回图形的node_weight。该模型具有两层GRU&＃xff0c;因此以下代码我将0用作时间步长&＃xff0c;因此时间步长必须为0或1。

def drawmol(idx, dataset, timestep):smiles, graph, _ &＃61; dataset[idx]print(smiles)bg &＃61; dgl.batch([graph])atom_feats, bond_feats &＃61; bg.ndata[&＃39;hv&＃39;], bg.edata[&＃39;he&＃39;]if torch.cuda.is_available():print(&＃39;use cuda&＃39;)bg.to(torch.device(&＃39;cuda:0&＃39;))atom_feats &＃61; atom_feats.to(&＃39;cuda:0&＃39;)bond_feats &＃61; bond_feats.to(&＃39;cuda:0&＃39;)_, atom_weights &＃61; model(bg, atom_feats, bond_feats, get_node_weight&＃61;True)assert timestep

`绘制测试数据集分子`

 该模型预测溶解度&＃xff0c;颜色表示红色是溶解度的积极影响&＃xff0c;蓝色是负面影响。
 target &＃61; test_loader.dataset
for i in range(len(target)):mol, aw, svg &＃61; drawmol(i, target, 0)display(SVG(svg)) 
 。。。。。 
 
 

参考资料
 1. https://github.com/dmlc/dgl/tree/master/apps/life_sci
 2. https://github.com/dmlc/dgl/blob/master/python/dgl/model_zoo/chem/attentive_fp.py
 3. https://pubs.acs.org/doi/full/10.1021/acs.jcim.9b00387




    
        
                        pytorch
                        深度学习
                        import
                        random
                        function
                        char
                        hybrid
                        io
                        uri
                    
    



    
        写下你的评论吧 !
        
            
                吐个槽吧,看都看了
            
            
                
                                        会员登录 | 用户注册
                                    
                
            
        

        
    

    
        推荐阅读
        
            
                                
                    
                        buffer
                        解析JSON格式文本并处理数据
                    

                    
                                                
                        本文介绍如何使用阿里云的fastjson库解析包含时间戳、IP地址和参数等信息的JSON格式文本，并进行数据处理和保存。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-26 16:06:09
                    

                

                
                                
                    
                        php
                        CentOS7源码编译安装MySQL5.6
                    

                    
                                                
                            
                        
                                                
                        2019独角兽企业重金招聘Python工程师标准一、先在cmake官网下个最新的cmake源码包cmake官网：https:www.cmake.org如此时最新 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 17:49:56
                    

                

                                
                    
                    
                
                
                                
                    
                        string
                        Java面试题解析
                    

                    
                                                
                        本文详细介绍了Java编程语言中的核心概念和常见面试问题，包括集合类、数据结构、线程处理、Java虚拟机（JVM）、HTTP协议以及Git操作等方面的内容。通过深入分析每个主题，帮助读者更好地理解Java的关键特性和最佳实践。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 13:55:14
                    

                

                
                                
                    
                        include
                        UNP 第9章：主机名与地址转换
                    

                    
                                                
                            
                        
                                                
                        本章探讨了用于在主机名和数值地址之间进行转换的函数，如gethostbyname和gethostbyaddr。此外，还介绍了getservbyname和getservbyport函数，用于在服务器名和端口号之间进行转换。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 11:26:39
                    

                

                
                                
                    
                        email
                        PHP 过滤器详解
                    

                    
                                                
                            
                        
                                                
                        本文深入探讨了 PHP 中的过滤器机制，包括常见的 $_SERVER 变量、filter_has_var() 函数、filter_id() 函数、filter_input() 函数及其数组形式、filter_list() 函数以及 filter_var() 和其数组形式。同时，详细介绍了各种过滤器的用途和用法。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-23 19:05:02
                    

                

                
                                
                    
                        list
                        java编写的简易计算器
                    

                    
                                                
                        主要用了2个类来实现的，话不多说，直接看运行结果，然后在奉上源代码1.Index.javaimportjava.awt.Color;im ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 18:18:10
                    

                

                
                                
                    
                        php
                        深入理解 SQL 视图、存储过程与事务
                    

                    
                                                
                            
                        
                                                
                        本文详细介绍了SQL中的视图、存储过程和事务的概念及应用。视图为用户提供了一种灵活的数据查询方式，存储过程则封装了复杂的SQL逻辑，而事务确保了数据库操作的完整性和一致性。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 17:40:42
                    

                

                
                                
                    
                        const
                        分页插件3指定到某一页
                    

                    
                                                
                        前言--页数多了以后需要指定到某一页（只做了功能，样式没有细调）html ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 15:19:01
                    

                

                
                                
                    
                        list
                        Java 序列化接口详解
                    

                    
                                                
                        本文深入探讨了 Java 中的 Serializable 接口，解释了其实现机制、用途及注意事项，帮助开发者更好地理解和使用序列化功能。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 15:06:12
                    

                

                
                                
                    
                        format
                        DNN Community 和 Professional 版本的主要差异
                    

                    
                                                
                            
                        
                                                
                        本文详细解析了 DotNetNuke (DNN) 的两种主要版本：Community 和 Professional。通过对比两者的功能和附加组件，帮助用户选择最适合其需求的版本。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 13:14:08
                    

                

                
                                
                    
                        format
                        ImmutableX Poised to Pioneer Web3 Gaming Revolution
                    

                    
                                                
                            
                        
                                                
                        ImmutableX is set to spearhead the evolution of Web3 gaming, with its innovative technologies and strategic partnerships driving significant advancements in the industry. ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 08:55:17
                    

                

                
                                
                    
                        include
                        Weight the Tree（树形dp）
                    

                    
                                                
                        题目Link题目学习link1题目学习link2题目学习link3%%%受益匪浅！－－－－－&# ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-26 15:55:56
                    

                

                
                                
                    
                        include
                        深入解析 Apache Shiro 安全框架架构
                    

                    
                                                
                            
                        
                                                
                        本文详细介绍了 Apache Shiro，一个强大且灵活的开源安全框架。Shiro 专注于简化身份验证、授权、会话管理和加密等复杂的安全操作，使开发者能够更轻松地保护应用程序。其核心目标是提供易于使用和理解的API，同时确保高度的安全性和灵活性。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-25 16:03:57
                    

                

                
                                
                    
                        filter
                        深入解析 Spring Security 用户认证机制
                    

                    
                                                
                            
                        
                                                
                        本文将详细介绍 Spring Security 中用户登录认证的核心流程，重点分析 AbstractAuthenticationProcessingFilter 和 AuthenticationManager 的工作原理。通过理解这些组件的实现，读者可以更好地掌握 Spring Security 的认证机制。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-25 16:00:21
                    

                

                
                                
                    
                        buffer
                        HTTP请求与响应机制详解
                    

                    
                                                
                        本文深入探讨了HTTP请求和响应对象的使用，详细介绍了如何通过响应对象向客户端发送数据、处理中文乱码问题以及常见的HTTP状态码。此外，还涵盖了文件下载、请求重定向、请求转发等高级功能。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-23 20:40:08

















    

    
        
            
            
                
                
            

            
                手机用户2502892403            

            
                这个家伙很懒，什么也没留下！            


        
    

    
    

    
    

    
        Tags | 热门标签
        
            
                                
                    c语言
                
                                
                    php7
                
                                
                    datetime
                
                                
                    less
                
                                
                    cmd
                
                                
                    request
                
                                
                    vba
                
                                
                    regex
                
                                
                    bit
                
                                
                    ip
                
                                
                    function
                
                                
                    default
                
                                
                    filter
                
                                
                    format
                
                                
                    string
                
                                
                    metadata
                
                                
                    case
                
                                
                    typescript
                
                                
                    const
                
                                
                    php
                
                                
                    list
                
                                
                    client
                
                                
                    heap
                
                                
                    keyword
                
                                
                    php5
                
                                
                    heatmap
                
                                
                    include
                
                                
                    buffer
                
                                
                    email
                
                                
                    subset
                
                                
            
        
    

    
    
        
            
            
        
        RankList | 热门文章
        
            
                                
                    1网址出现error.aspx?aspxerrorpath=404.htm?aspxerrorpath=的原因及解决办法转
                
                                
                    203-谷歌浏览器安装Sence
                
                                
                    3我做了一个很长的梦
                
                                
                    4oracle 并行 拆 分,goldengate ogg 常用调优方法 –  并行处理与进程拆分
                
                                
                    5Linux c 共享内存
                
                                
                    6情况不同
                
                                
                    7NOIP提高组 2011
                
                                
                    8kubernetes 下实现socket.io 的集群模式
                
                                
                    9波卡在 ETHDenver 2023
                
                                
                    10中国移动咪咕燃爆ChinaJoy，5G云游戏领衔全场景沉浸体验生态
                
                                
                    11java.nio.file.attribute.BasicFileAttributes.size()方法的使用及代码示例
                
                                
                    12R开发：协调过滤推荐
                
                                
                    13微博故事怎么分享自己的故事？微博故事怎么参与？
                
                                
                    14linux下忘记mysql root密码
                
                                
                    15[Linux]vi